r 



G. SEQUENCE LISTING 

5 (1) GENERAL INFORMATION 

(i) APPLICANT; Darrall Anderson. Nabil Hanna. John Leonard, 

Roland Newman and Mitchell RefFand William H. 
Rastetter 

10 

(ii) TITLE OF INVENTION: THERAPEUTIC APPLICATION OF 

CHIMERIC AND RADIOLABELED 
ANTIBODIES TO HUMAN B 
LYMPHOCYTE RESTRICTED 
15 DIFFERENTIATION ANTIGEN FOR 

TREATMENT OF B CELL LYMPHOMA 

(iii) NUMBER OF SEQUENCES: 8 
20 (iv) CORRESPONDING ADDRESS: 

(A) ADDRESSEE: IDEC Pharmaceuticals Corporation 

(B) STREET: 11011 Torreyana Road 

(C) CITY: San Diego 
25 (D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 92121 

(v) COMPUTER READABLE FORM: 

30 

(A) MEDIUM TYPE: Diskette, 3.5 inch, 1.44 Mb 

(B) COMPUTER: Macintosh 

(C) OPERATING SYSTEM: MS.DOS 

(D) SOFTWARE: Microsoft Word 5.0 

35 

(vi CURRE>Pr APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

40 (C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Burgoon, Richard P. Jr. 

45 (B) REGISTRATION NUMBER: 34,787 

(C) REFERENCE/DOCKET NUMBER: 

(ix) TELECOMMUNICATION INFORMATION: 

50 (A) TELEPHONE: (619)550-8500 

(B) TELEFAX: (619)550-8750 
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(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8540 bases 

<B) TYPE: nuolaicftpid 

(C) STRANDEDNB5SS: single 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 
15 (iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

20 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

25 ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 3 60 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 

CATCACAGAT CTCTCACCAT GAGGGTCCCC GCTCAGCTCC TGGGGCTCCT GCTGCTCTGG 1020 

CTCCCAGGTG CACGATGTGA TGGTACCAAG GTGGAAATCA AACGTACGGT GGCTGCACCA 1080 

55 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1140 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1200 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 1260 



60 
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AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1320 

TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1380 

TGTTGAATTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 1440 

GGCCGTGATA TCTACGTATG ATCAGCCTCG ACTGTGCCTT CTAGTTGCCA GCCATCTGTT 1500 

GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 1560 

TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 1620 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 1680 

GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCTAT GCCAAGTACG CCCCCTATTG 1740 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 1800 

TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 1860 

GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 1920 

CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 1980 

GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 2040 

TAAGCAGAGC TGGGTACGTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 2100 

ATGGGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTACGCGTGT CGCTAGCACC 2160 

AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 2220 

GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 2280 

GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 2340 

TCCCTCAGCA GCGTGGTGAC CGTGCCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 2400 

AACGTGAATC ACAAGCCCAG CAACACCAAG GTGGACAAGA AAGCAGAGCC CAAATCTTGT 2460 

GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 2520 

TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 2580 

45 TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 2640 

GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 2700 

CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 2760 

50 

TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 2820 

GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 2880 

55 AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 2940 

TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 3000 

GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AOAGCAGGTG GCAGCAGGGG 3060 

60 

AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 3120 

CTCTCCCTGT CTCCGGGTAA ATGAGGATCC GTTAACGGTT ACCAACTACC TAGACTGGAT 3180 
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TCGTGACAAC ATGCGGCCGT GATATCTACG TATGATCAGC CTCGACTGTG CCTTCTAGTT 3240 

GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 3300 

5 

CCACTGTCCT TTCCTAATAA AATGAGGAAA TTGCATCGCA TTGTCTGAGT AGGTGTCATT 3360 

efiit'Wtvwia actn-fnui^iiiira ueiHPAtaiaiAt^A cteAActc^auuA ooAtoftMsieMUk SMaAAirAot^A 3A«e 

10 GGCATGCTGG GGATGCGGTG GGCTCTATGG AACCAGCTGG GGCTCGACAG CGCTGGATCT 3480 

CCCGATCCCC AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 3540 

AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3 600 

15 

AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3 660 

GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 3720 

'^^S GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 3780 

--'-^ GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 3840 

CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC 3900 

:=t 25 

■'^ CAAACTTGAC GGCAATCCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCATG 3960 

Sfi GTTCGACCAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG CAAGAACGGA 4020 

30 GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4080 

';==: TCTTCAGTGG AAGGTAAACA GAATCTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 4140 

CCTGAGAAGA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 4200 

GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 4260 

GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTCTGTT 4320 

TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 43 80 

GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 4440 

GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 4500 

GTCTACGAGA AGAAAGACTA ACAGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 

CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGATCAGCCT CGACTGTGCC 4620 

50 TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 4680 

TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 47 40 

GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 

55 

CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4860 

TAGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4920 

60 ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4980 

TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA CCCAGAGCTG AGACTCCTAA 5040 
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GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 

CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 

5 AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 5220 

TTGTGTTGGG AGCTTGGATC GATCCTCTAT GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 

TCcaoccacT TaesaTOOAOA accTATreaei eTATOACToa gcacaacaqa caatcocicto 3340 

10 

CTCTGATGCC GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 5400 

CGACCTGTCC GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 54 60 

15 CACGACGGGC GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 5520 

GCTGCTATTG GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 5580 

GAAAGTATCC ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 5640 

20 

CCCATTCGAC CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 5700 

TCTTGTCGAT CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 5760 

25 CGCCAGGCTC AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC 5820 

CTGCTTGCCG AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 5880 

GCTGGGTGTG GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 5940 

GCTTGGCGGC GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 6000 

GCAGCGCATC GCCTTCTATC GCCTTCTTGA CGAGTTCTTC TGAGCGGGAC TCTGGGGTTC 6060 

GAAATGACCG ACCAAGCGAC GCCCAACCTG CCATCACGAG ATTTCGATTC CACCGCCGCC 6120 

TTCTATGAAA GGTTGGGCTT CGGAATCGTT TTCCGGGACG CCGGCTGGAT GATCCTCCAG 6180 

CGCGGGGATC TCATGCTGGA GTTCTTCGCC CACCCCAACT TGTTTATTGC AGCTTATAAT 6240 

GGTTACAAAT AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 6300 

TCTAGTTGTG GTTTGTCCAA ACTCATCAAT CTATCTTATC ATGTCTGGAT CGCGGCCGCG 6360 

ATCCCGTCGA GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 6420 

CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 6480 

AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 6540 

ACCTGTCGTG CCAGCTGCAT TAATGAAtCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 6600 

TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 6660 

GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 6720 

CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 6780 

TGCTGGCGTT TTTCCATAGG CTCCGCCCCC CTGACGAGCA TCACAAAAAT CGACGCTCAA 6840 

GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 6900 

CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 6960 
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CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAG7 TCGGTGTAGG 7020 

TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 7080 

TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 7140 

CAQCCACTOO TAACAOQATT AOCAnftOrrift aOTATaTAqo CQO'raQT&CA CAaTTCTTaA 7 3 oo 

AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 7 2 60 

AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 73 20 

GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 73 80 

AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 744 0 

GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT 7500 

GAAGTTTTAA ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACAGT TACCAATGCT 7560 

TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 7620 

TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 7680 

TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 774 0 

GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA CTTTATCCGC CTCCATCCAG TCTATTAATT 7800 

GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 7860 

TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 7920 

CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 7980 

TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 8040 

CAGCACTGCA TAATTCTCTT ACTGTCATGC CATCCGTAAG ATGCTTTTCT GTGACTGGTG 8100 

AGTACTCAAC CAAGTCATTC TGAGAATAGT GTATGCGGCG ACCGAGTTGC TCTTGCCCGG 8160 

CGTCAATACG GGATAATACC GCGCCACATA GCAGAACTTT AAAAGTGCTC ATCATTGGAA 822 0 

AACGTTCTTC GGGGCGAAAA CTCTCAAGGA TCTTACCGCT GTTGAGATCC AGTTCGATGT 8280 

AACCCACTCG TGCACCCAAC TGATCTTCAG CATCTTTTAC TTTCACCAGC GTTTCTGGGT 8340 

GAGCAAAAAC AGGAAGGCAA AATGCCGCAA AAAAGGGAAT AAGGGCGACA CGGAAATGTT 8400 

GAATACTCAT ACTCTTCCTT TTTCAATATT AtTGAAGCAT TTATCAGGGT TATTGTCTCA 8460 

TGAGCGGATA CATATTTGAA TGTATTTAGA AAAATAAACA AATAGGGGTT CCGCGCACAT 8520 

TTCCCCGAAA AGTGCCACCT 8540 



(3) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 9209 bases 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: circular 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: y«« 

(iv) ANTI-SENSE: no 

10 

(ix) SEQUENCE DESCRIPTION: SEQIDNO: 2: 

GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 

15 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 180 

20 ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 3 60 

25 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

30 ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 

35 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 

40 TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

CGGTGGGAGG TCTATATAAG CAGAGCTGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 

45 

CATCACAGAT CTCTCACTAT GGATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 1020 

GCTTCAGTCA TAATGTCCAG AGGACAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 1080 

50 GCATCTCCAG GGGAGAAGGT CACAATGACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 1140 

CACTGGTTCC AGCAGAAGCC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 1200 

CTGGCTTCTG GAGTCCCTGT TCGCTTCAGT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 1260 

55 

ACAATCAGCA GAGTGGAGGC TGAAGATGCT GCCACTTATT ACTGCCAGCA GTGGACTAGT 1320. 

AACCCACCCA CGTTCGGAGG GGGGACCAAG CTGGAAATCA AACGTACGGT GGCTGCACCA 1380 

60 TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1440 
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AGCACGTACC 
GAGTACAAGT 
AAAGCCAAAG 

CTCACCAAQA 

GCCGTGGAGT 
CTGGACTCCG 
CAGCAGGGGA 
CAGAAGAGCC 
AGACTGGATT 
CTTCTAGTTG 
GTGCCACTCC 
GGTGTCATTC 
ACAATAGCAG 
GCTGGATCTC 
AGGAAAATTA 
ATGCTTTAGA 
CAGAGCTGAG 
ATCACCGAAG 
ATAGAGAGGG 
CATTTGCTTC 
ATTTCGCGCC 
GCCATCATGG 
AAGAACGGAG 
ACCACAACCT 
TTCTCCATTC 
GAACTCAAAG 
AGACTTATTG 
AGTTCTGTTT 
ATCATGCAGG 
CTTCTCCCAG 
AAGTTTGAAG 



GTGTGGTCAG 
GCAAGGTCTC 
GGCAGCCCCG 

AGCAOQTCMl 

GGGAGAGCAA 
ACGGCTCCTT 
ACGTCTTCTC 
TCTCCCTGTC 
CGTGACAACA 
CCAGCCATCT 
CACTGTCCTT 
TATTCTGGGG 
GCATGCTGGG 
CCGATCCCCA 
ATTTTAACAC 
GACAGTGTTC 
ACTCCTAAGC 
CCTGATTCCG 
CAGGAGCCAG 
TGACATAGTT 
AAACTTGACG 
TTCGACCATT 
ACCTACCCTG 
CTTCAGTGGA 
CTGAGAAGAA 
AACCACCACG 
AACAACCGGA 
ACCAGGAAGC 
AATTTGAAAG 
AATACCCAGG 
TCTACGAGAA 



CGTCCTCACC 
CAACAAAGCC 
AGAACCACAG 

GCTOACCTOC 

TGGGCAGCCG 
CTTCCTCTAC 
ATGCTCCGTG 
TCCGGGTAAA 
TGCGGCCGTG 
GTTGTTTGCC 
TCCTAATAAA 
GGTGGGGTGG 
GATGCGGTGG 
GCTTTGCTTC 
CAATTCAGTA 
TCTGCACAGA 
CAGTGAGTGG 
TAGAGCCACA 
GGCAGAGCAT 
GTGTTGGGAG 
GCAATCCTAG 
GAACTGCATC 
GCCTCCGCTC 
AGGTAAACAG 
TCGACCTTTA 
AGGAGCTCAT 
ATTGGCAAGT 
CATGAATCAA 
TGACACGTTT 
CGTCCTCTCT 
GAAAGACTAA 



GTCCTGCACC 
CTCCCAGCCC 
GTGTACACCC 

GAGAACAACT 
AGCAAGCTCA 
ATGCATGAGG 
TGAGGATCCG 
ATATCTACGT 
CCTCCCCCGT 
ATGAGGAAAT 
GGCAGGACAG 
GCTCTATGGA 
TCAATTTCTT 
GTTGATTGAG 
TAAGGACAAA 
CACAGCATTC 
CCTTGGTAAG 
ATAAGGTGAG 
CTTGGATAGC 
CGTGAAGGCT 
GTCGCCGTGT 
AGGAACGAGT 
AATCTGGTGA 
AAGGACAGAA 
TTTCTTGCCA 
AAAGTAGACA 
CCAGGCCACC 
TTCCCAGAAA 
GAGGTCCAGG 
CAGGAAGATG 



AGGACTGGCT 
CCATCGAGAA 
TGCCCCCATC 

QCTTCTATCC 

ACAAGACCAC 
CCGTGGACAA 
CTCTGCACAA 
TTAACGGTTA 
ATGATCAGCC 
GCCTTCCTTG 
TGCATCGCAT 
CAAGGGGGAG 
ACCAGCTGGG 
ATTTGCATAA 
CAAATGCGTT 
CATTATTCAG 
TAGGGAGAAA 
GGCCAATCTG 
GTAGGATCAG 
TTGGACAGCT 
GGTAGGATTT 
CCCAAAATAT 
TCAAGTACTT 
TTATGGGTAG 
TTAATATAGT 
AAAGTTTGGA 
TGGTTTGGAT 
TTAGACTCTT 
TTGATTTGGG 
AGGAAAAAGG 
CTTTCAAGTT 



GAATGGCAAG 
AACCATCTCC 
CCGGGATGAG 
cAQcaACATc: 
GCCTCCCGTG 
GAGCAGGTGG 
CCACTACACG 
CCAACTACCT 
TCGACTGTGC 
ACCCTGGAAG 
TGTCTGAGTA 
GATTGGGAAG 
GCTCGACAGC 
TGAGAAAAAA 
GCCAAAAAGG 
AGGGAGTACC 
TATGCTTGTC 
CTCACACAGG 
TTGCTCCTCA 
CAGGGCTGCG 
TATCCCCGCT 
GGGGATTGGC 
CCAAAGAATG 
GAAAACCTGG 
TCTCAGTAGA 
TGATGCCTTA 
AGTCGGAGGC 
TGTGACAAGG 
GAAATATAAA 
CATCAAGTAT 
CTCTGCTCCC 



3420 
3480 
3540 

3600 

3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
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CTCCTAAAGC TATGCATTTT TATAAGACCA 
GACTGTGCCT TCTAGTTGCC AGCCATCTGT 
5 CCTGGAAGGT GCCACTCCCA CTGTCCTTTC 

TCTGAGTAGG TGTCATTCTA TTCTGGGGGG 
TTCGOAAOAc: AATAGCAaaC AliOCTaOCiaA 

10 

TCGAGCTACT AGCTTTGCTT CTCAATTTCT 
AATTTTAACA CCAATTCAGT AGTTGATTGA 
15 AGACAGTGTT CTCTGCACAG ATAAGGACAA 

GACTCCTAAG CCAGTGAGTG GCACAGCATT 
GCCTGATTCC GTAGAGCCAC ACCTTGGTAA 

20 

GCAGGAGCCA GGGCAGAGCA TATAAGGTGA 
CTGACATAGT TGTGTTGGGA GCTTGGATCG 
25 CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG 

AATCGGCTGC TCTGATGCCG CCGTGTTCCG 
TGTCAAGACC GACCTGTCCG GTGCCCTGAA 

30 

GTGGCTGGCC ACGACGGGCG TTCCTTGCGC 
AAGGGACTGG CTGCTATTGG GCGAAGTGCC 
35 TCCTGCCGAG AAAGTATCCA TCATGGCTGA 

GGCTACCTGC CCATTCGACC ACCAAGCGAA 
GGAAGCCGGT CTTGTCGATC AGGATGATCT 

40 

CGAACTGTTC GCCAGGCTCA AGGCGCGCAT 
TGGCGATGCC TGCTTGCCGA ATATCATGGT 
45 CTGTGGCCGG CTGGGTGTGG CGGACCGCTA 

TGCTGAAGAG CTTGGCGGCG AATGGGCTGA 
TCCCGATTCG CAGCGCATCG CCTTCTATCG 

50 

CTGGGGTTCG AAATGACCGA CCAAGCGACG 
ACCGCCGCCT TCTATGAAAG GTTGGGCTTC 
55 ATCCTCCAGC GCGGGGATCT CATGCTGGAG 

GCTTATAATG GTTACAAATA AAGCAATAGC 
TCACTGCATT CTAGTTGTGG TTTGTCCAAA 

60 

GCGGCCGCGA TCCCGTCGAG AGCTTGCCGT 
ATTGTTATCC GCTCACAATT CCACACAACA 




r 

TGGGACTTTT GCTGGCTTTA GATCAGCCTC 52 80 

TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 5340 

CTAATAAAAT GAGGAAATTG CATCGCATTG 5400 

TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 5460 

WCOaTcSGOC TCTATGGAAC CAQCTOaaCSC 5520 

TATTTGCATA ATGAGAAAAA AAGGAAAATT 5580 

GCAAATGCGT TGCCAAAAAG GATGCTTTAG 5640 

ACATTATTCA GAGGGAGTAC CCAGAGCTGA 5700 

CTAGGGAGAA ATATGCTTGT CATCACCGAA 5760 

GGGCCAATCT GCTCACACAG GATAGAGAGG 5820 

GGTAGGATCA GTTGCTCCTC ACATTTGCTT 5880 

ATCCTCTATG GTTGAACAAG ATGGATTGCA 5940 

GCTATTCGGC TATGACTGGG CACAACAGAC 6000 

GCTGTCAGCG CP^GGGGCGCC CGGTTCTTTT 6060 

TGAACTGCAG GACGAGGCAG CGCGGCTATC 6120 

AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 6180 

GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 6240 

TGCAATGCGG CGGCTGCATA CGCTTGATCC 6300 

ACATCGCATC GAGCGAGCAC GTACTCGGAT 6360 

GGACGAAGAG CATCAGGGGC TCGCGCCAGC 6420 

GCCCGACGGC GAGGATCTCG TCGTGACCCA 6480 

GGAAAATGGC CGCTTTTCTG GATTCATCGA 6540 

TCAGGACATA GCGTTGGCTA CCCGTGATAT 6600 

CCGCTTCCTC GTGCTTTACG GTATCGCCGC 6660 

CCTTCTTGAC GAGTTCTTCT GAGCGGGACT 6720 

CCCAACCTGC CATCACGAGA TTTCGATTCC 6780 

GGAATCGTTT TCCGGGACGC CGGCTGGATG 6840 

TTCTTCGCCC ACCCCAACTT GTTTATTGCA 6900 

ATCACAAATT TCACAAATAA AGCATTTTTT 6960 

CTCATCAATC TATCTTATCA TGTCTGGATC 7020 

AATCATGGTC ATAGCTGTTT CCTGTGTGAA 7080 

TACGAGCCGG AAGCATAAAG TGTAAAGCCT 7140 
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GGGCSTGCCTA ATGAGTGAGC TAACTCACAT TAATTGCGTT GCGCTCACTG CCCGCTTTCC 7200 

AGTCGGGAAA CCTGTCGTGC CAGCTGCATT AATGAATCGG CCAACGCGCG GGGAGAGGCG 7260 

GTTTGCGTAT TGGGCGCTCT TCCGCTTCCT CGCTCACTGA CTCGCTGCGC TCGGTCGTTC 7320 

QQCTQCQOCO ACCOCTATCA OCTCACTCAA AQOCOOTAAT ACOOTTATCC ACAOAATCAO 73 BO 

GGGATAACGC AGGAAAGAAC ATGTGAGCAA AAGGCCAGCA AAAGGCCAGG AACCGTAAAA 7440 

AGGCCGCGTT GCTGGCGTTT TTCCATAGGC TCCGCCCCCC TGACGAGCAT CACAAAAATC 7500 

GACGCTCAAG TCAGAGGTGG CGAAACCCGA CAGGACTATA AAGATACCAG GCGTTTCCCC 7560 

CTGGAAGCTC CCTCGTGCGC TCTCCTGTTC CGACCCTGCC GCTTACCGGA TACCTGTCCG 7620 

CCTTTCTCCC TTCGGGAAGC GTGGCGCTTT CTCAATGCTC ACGCTGTAGG TATCTCAGTT 7680 

20 CGGTGTAGGT CGTTCGCTCC AAGCTGGGCT GTGTGCACGA ACCCCCCGTT CAGCCCGACC 7740 

GCTGCGCCTT ATCCGGTAAC TATCGTCTTG AGTCCAACCC GGTAAGACAC GACTTATCGC 7800 

CACTGGCAGC AGCCACTGGT AACAGGATTA GCAGAGCGAG GTATGTAGGC GGTGCTACAG 7860 

25 

AGTTCTTGAA GTGGTGGCCT AACTACGGCT ACACTAGAAG GACAGTATTT GGTATCTGCG 7920 

CTCTGCTGAA GCCAGTTACC TTCGGAAAAA GAGTTGGTAG CTCTTGATCC GGCAAACAAA 7980 

30 CCACCGCTGG TAGCGGTGGT TTTTTTGTTT GCAAGCAGCA GATTACGCGC AGAAAAAAAG 8040 

GATCTCAAGA AGATCCTTTG ATCTTTTCTA CGGGGTCTGA CGCTCAGTGG AACGAAAACT 8100 

CACGTTAAGG GATTTTGGTC ATGAGATTAT CAAAAAGGAT CTTCACCTAG ATCCTTTTAA 8160 

35 

ATTAAAAATG AAGTTTTAAA TCAATCTAAA GTATATATGA GTAAACTTGG TCTGACAGTT 8220 

ACCAATGCTT AATCAGTGAG GCACCTATCT CAGCGATCTG TCTATTTCGT TCATCCATAG 8280 

40 TTGCCTGACT CCCCGTCGTG TAGATAACTA CGATACGGGA GGGCTTACCA TCTGGCCCCA 8340 

GTGCTGCAAT GATACCGCGA GACCCACGCT CACCGGCTCC AGATTTATCA GCAATAAACC 8400 

AGCCAGCCGG AAGGGCCGAG CGCAGAAGTG GTCCTGCAAC TTTATCCGCC TCCATCCAGT 8460 

45 

CTATTAATTG TTGCCGGGAA GCTAGAGTAA GTAGTTCGCC AGTTAATAGT TTGCGCAACG 8520 

TTGTTGCCAT TGCTACAGGC ATCGTGGTGT CACGCTCGTC GTTTGGTATG GCTTCATTCA 8580 

50 GCTCCGGTTC CCAACGATCA AGGCGAGTTA CATGATCCCC CATGTTGTGC AAAAAAGCGG 8640 

TTAGCTCCTT CGGTCCTCCG ATCGTTGTCA GAAGTAAGTT GGCCGCAGTG TTATCACTCA 87 00 

TGGTTATGGC AGCACTGCAT AATTCTCTTA CTGTCATGCC ATCCGTAAGA TGCTTTTCTG 8760 

55 

TGACTGGTGA GTACTCAACC AAGTCATTCT GAGAATAGTG TATGCGGCGA CCGAGTTGCT 8820 

CTTGCCCGGC GTCAATACGG GATAATACCG CGCCACATAG CAGAACTTTA AAAGTGCTCA 8880 

60 TCATTGGAAA ACGTTCTTCG GGGCGAAAAC TCTCAAGGAT CTTACCGCTG TTGAGATCCA 8940 

GTTCGATGTA ACCCACTCGT GCACCCAACT GATCTTCAGC ATCTTTTACT TTCACCAGCG 9000 
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TTTCTGGGTG AGCAAAAACA GGAAGGCAAA ATGCCGCAAA AAAGGGAATA AGGGCGACAC 9060 

GGAAATGTTG AATACTCATA CTCTTCCTTT TTCAATATTA TTGAAGCATT TATCAGGGTT 9120 

5 ATTGTCTCAT GAGCGGATAC ATATTTGAAT GTATTTAGAA AAATAAACAA ATAGGGGTTC 9180 

CGCGCACATT TCCCCGAAAA GTGCCACCT 9209 

10 (4) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 bases 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

20 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

25 (ix) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

5' ATC ACA GAT CTC TCA CCA TGG ATT TTC AGG TBC AGA TTA TCA GCT 52 
TC 3' 2 

30 

(5) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

35 -■ (A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: yes 

45 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

5' TGC AGC ATC CGT ACG TTT GAT TTC CAG CTT 3' 30 

(6) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

55 



50 
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(A) LENGTH: 384 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



ATG 


GAT 


TTT 


CAG 


GTG 


CAG 


ATT 


ATC 


AGC 


TTC 


CTG 


CTA 


ATC 


AGT 


GCT 


TCA 


GTC 


51 


ATA 


ATG 


TCC 


AGA 


GGG 


CAA 


ATT 


GTT 


CTC 


TCC 


CAG 


TCT 


CCA 


GCA 


ATC 


CTG 


TCT 


102 


GCA 


TCT 


CCA 


GGG 


GAG 


AAG 


GTC 


ACA 


ATG 


ACT 


TGC 


AGG 


GCC 


AGC 


TCA 


AGT 


GTA 


153 


AGT 


TAG 


ATC 


CAC 


TGG 


TTC 


CAG 


CAG 


AAG 


CCA 


GGA 


TCC 


TCC 


CCC 


AAA 


CCC 


TGG 


204 


ATT 


TAT 


GCC 


ACA 


TCC 


AAC 


CTG 


GCT 


TCT 


GGA 


GTC 


CCT 


GTT 


CGC 


TTC 


AGT 


GGC 


255 


AGT 


GGG 


TCT 


GGG 


ACT 


TCT 


TAC 


TCT 


CTC 


ACA 


ATC 


AGC 


AGA 


GTG 


GAG 


GCT 


GAA 


306 


GAT 


GCT 


GCC 


ACT 


TAT 


TAC 


TGC 


CAG 


CAG 


TGG 


ACT 


AGT 


AAC 


CCA 


CCC 


ACG 


TTC 


357 


GGA 


GGG 


GGG 


ACC 


AAG 


CTG 


GAA 


ATC 


AAA 
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(7) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



50 5 • GCG GCT CCC ACG CGT GTC CTG TCC CAG 3 ■ 
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(8) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: mxxaU 

(D) TOPOLOQY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: yes 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

5' GGS TGT TGT GCT AGC TGM RGA GAC RGT GA 3 ' 29 

(9) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: yes 

(iv) ANTI-SENSE: no 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

ATG GGT TGG AGC CTC ATC TTG CTC TTC CTT GTC GCT GTT GCT ACG CGT GTC 51 

CTG TCC CAG GTA CAA CTG CAG CAG CCT GGG GCT GAG CTG GTG AAG CCT GGG 102 

GCC TCA GTG AAG ATG TCC TGC AAG GCT TCT GGC TAC ACA TTT ACC AGT TAC 153 

AAT ATG CAC TGG GTA AAA CAG ACA CCT GGT CGG GGC CTG GAA TGG ATT GGA 204 

GCT ATT TAT CCC GGA AAT GGT GAT ACT TCC TAC AAT CAG AAG TTC AAA GGC 255 

AAG GCC ACA TTG ACT GCA GAC AAA TCC TCC AGC ACA GCC TAC ATG CAG CTC 305 

AGC AGC CTG ACA TCT GAG GAC TCT GCG GTC TAT TAC TGT GCA AGA TCG ACT 357 

TAC TAC GGC GGT GAC TGG TAC TTC AAT GTC TGG GGC GCA GGG ACC ACG GTC 403 
ACC GTC TCT GCA 420 
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